Predictors of Pause Duration in Read-Aloud Discourse
نویسندگان
چکیده
منابع مشابه
Pause duration and variability in read texts
Generating natural sounding synthetic speech from text requires a division of a text into IPs and assigning pauses between those phrases. A difficulty which faces attempts to model pauses quantitatively is high degree of variability exhibited by speakers in pause placement and duration. The present study seeks to investigate if Synchronous Speech (speech elicited when two speakers are asked to ...
متن کاملPause Duration and Variabil
Generating natural sounding synthetic speech from text requires a division of a text into IPs and assigning pauses between those phrases. A difficulty which faces attempts to model pauses quantitatively is high degree of variability exhibited by speakers in pause placement and duration. The present study seeks to investigate if Synchronous Speech (speech elicited when two speakers are asked to ...
متن کاملAcoustical features as predictors for prominence in read aloud dutch sentences used in ANN's
In this paper we present several acoustical features, which are used as predictors for prominence. A set of 1244 sentences from 273 different speakers is selected from the Dutch Polyphone Corpus. Via listening experiments the subjective prominence markers are obtained. Several acoustical features concerning F0, energy and duration are derived and used as predictors for prominence. The sentences...
متن کاملPitch range and pause duration as markers of discourse hierarchy: perception experiments
Discourse structure is reflected by a number of global prosodic parameters, like for example pause duration and pitch range. Discourse structure is also known to affect the accessibility/salience of antecedents of anaphoric expressions. Assuming these generalizations are correct, one can ask whether listeners use the information encoded in pauses and pitch range to resolve anaphoric references ...
متن کاملAlignment Algorithms for Learning to Read Aloud
A complete system of learning spelling-tophoneme conversion of English words consists of three major processes: alignment, mapping learning, and grapheme generation. Such a system can be used to construct prototypes of reading machines for English or other lan guages quickly and automatically. This paper focusses on the alignment process, which is crit ical to mapping learning and grapheme ge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEICE Transactions on Information and Systems
سال: 2014
ISSN: 0916-8532,1745-1361
DOI: 10.1587/transinf.e97.d.1461